OpenAI’s algorithm ‘Q*’ gives new direction to mathematical reasoning
‘Artificial general intelligence’ (AGI) has broad reasoning capabilities comparable to those of a human
image for illustrative purpose

Many of the techniques implemented by AI algorithms ultimately boils down to a mathematical area known as matrix algebra. A matrix is simply a grid of numbers, of which a digital image is a familiar example
The recent sacking and almost immediate rehiring of Sam Altman by OpenAI have once again sparked-off debates around the development and use of artificial intelligence (AI). What's more unusual is that a prominent theme in media reporting has been the ability of AI systems to do maths.
Apparently, some of the drama at OpenAI was related to the company's development of a new AI algorithm called Q*. The system has been talked about as a significant advancement and one of its salient features was a capability to reason mathematically. But isn't mathematics, the foundation of AI? How can an AI system have trouble with mathematical reasoning, given that computers and calculators can perform mathematical tasks?
AI is not a single entity. It's a patchwork of strategies for performing computation without the direct instruction of humans. As we'll see, some AI systems are competent at maths. However, one of the most important current technologies, the large language models (LLMs) behind AI chatbots such as ChatGPT, has struggled so far to emulate mathematical reasoning. This is because they have been designed to concentrate on language.
If the new Q* algorithm can solve unseen mathematical problems, then that might well be a significant breakthrough. Mathematics is an ancient form of human reasoning that large language models have so far struggled to emulate. LLMs are the technology that underlies systems like OpenAI's ChatGPT. At the time of writing, the details of the Q* algorithm and its capabilities are limited, but highly intriguing.
There are various subtleties to be taken into consideration before deeming Q* a success. These AI systems could be described as competent at maths. However, it's likely that Q* is not being used to help academics in their work but rather is intended for another purpose.
Nevertheless, even if Q* is incapable of pushing the boundaries of cutting-edge research, quite likely that there is some significance in the way it has been built. This could raise tantalising opportunities for future development.
Increasingly comfortable: As a society, we are increasingly comfortable with specialist AI being used to solve a multitude of predetermined problem. For example, digital assistants, facial recognition and online recommendation systems will be familiar to most people. What remains elusive is a so-called “artificial general intelligence” (AGI) that has broad reasoning capabilities comparable to those of a human. Mathematics is a basic skill that we aspire to teach to every school kid and qualifies as a fundamental milestone in the search for AGI. So how else would mathematically-competent AI systems be of help to society?
The mathematical mindset is relevant to numerous applications like coding and engineering. On that count, mathematical reasoning is a vital transferable skill for both human and artificial intelligence. One irony is that, at a fundamental level, AI is based on mathematics.
For example, many of the techniques implemented by AI algorithms ultimately boil down to a mathematical area known as matrix algebra. A matrix is simply a grid of numbers, of which a digital image is a familiar example. Each pixel is nothing more than numerical data. Large language models are also inherently mathematical. Based on a huge sample of text, a machine can learn the probabilities for the words that are most likely to follow a prompt (or question) from the user to the chatbot. If you want a pre-trained LLM to specialise in a particular topic, then it can be fine-tuned on mathematical literature, or any other domain of learning. LLMs can generate text that reads as if it understands mathematics. Unfortunately, doing so produces a LLM that is good at bluffing, but bad at detail. The issue is that a mathematical statement is, by definition, one that may be assigned an unambiguous Boolean value (that is, true or false). Mathematical reasoning amounts to the logical deduction of new mathematical statements from those previously established.
Devil's advocate: Naturally, any approach to mathematical reasoning that relies on linguistic probabilities is going to be driving outside its lane. One way around this could be to incorporate a system of formal verification into the architecture (exactly how the LLM is built), which continuously checks the logic behind the leaps made by the large language model.
A clue that this has been done could be in the name Q*, which could plausibly refer to an algorithm developed all the way back in the 1970s to help with deductive reasoning. Alternatively, Q* could refer to Q-learning, in which a model can improve over time by testing for and rewarding conclusions that are correct. But several challenges exist to building mathematically able AIs. For instance, some of the most interesting mathematics consists of highly unlikely events.
There are many situations in which one may think that a pattern exists based on small numbers but it unexpectedly breaks down when one checks enough cases. This capability is difficult to incorporate into a machine. Another challenge may come as a surprise: mathematical research can be highly creative. It has to be because practitioners need to invent new concepts and yet stick within the formal rules of an ancient subject. Any AI methodology trained only to find patterns in pre-existing mathematics could presumably never create genuinely new mathematics. Given the pipeline between mathematics and technology, this seems to preclude the conception of new technological revolutions. But let's play devil's advocate for a moment, and imagine whether AI could indeed create new mathematics.
The previous argument against this has a flaw, in that it could also be said that the best human mathematicians were also trained exclusively on pre-existing mathematics. Large language models have surprised us before, and will do so again.
(The writer is associated with University of Westminster)








